UPV/BUAP Participation in WebCLEF 2006
نویسندگان
چکیده
After our first participation in the Bilingual task of WebCLEF 2005, we have emigrated to a more challenging task. In this report we are presenting the results obtained after evaluating a set of topics in the Mixed-Monolingual task of WebCLEF 2006. Our efforts were focused on the preprocessing of the EuroGOV corpus which is itself a very challenging task, due to the high variety of errors that must be treated in order to correctly interpret the content of each document to index. Moreover, we have tested a new formula for the ranking of the documents retrieved, which is based on the Jaccard formula but includes a penalization factor. Results are low but encourage to investigate whether they are the result of a bad preprocessing process and/or the malfunction of the search engine components.
منابع مشابه
BUAP-UPV TPIRS: A System for Document Indexing Reduction at WebCLEF
In this paper we present the results of BUAP/UPV universities in WebCLEF, a particular task of CLEF 2005. Particularly, we evaluate our information retrieval system at the bilingual “English to Spanish” task. Our system uses a term reduction process based on the Transition Point technique. Our results show that it is possible to reduce the number of terms to index, thereby improving the perform...
متن کاملTPIRS: A System for Document Indexing Reduction on WebCLEF
In this paper we present the results of BUAP/UPV universities in WebCLEF, a particular task of CLEF 2005. Particularly, we evaluate our information retrieval system in the bilingual English to Spanish track. Our system uses a term reduction process based on the Transition Point technique. Our results show that it is possible to reduce the number of terms to index, thereby improving the performa...
متن کاملDublin City University at WebCLEF 2007
This paper describes our participation in the Multilingual Web Track (WebCLEF) 2007.
متن کاملThe University of Amsterdam at WebCLEF 2006
Our aim for our participation in WebCLEF 2006 was to investigate the robustness of information retrieval techniques to crosslingual retrieval, such as compact document representations, and query reformulation techniques. Our focus was on the mixed monolingual task. Apart from the proper preprocessing and transformation of various encodings, we did not apply any language-specific techniques. Ins...
متن کاملThe University of Amsterdam at WebCLEF 2005
We describe the University of Amsterdam’s participation in the WebCLEF track at CLEF 2005. We submitted runs for both the mixed monolingual task and the multilingual task.
متن کامل